"Less is More" in Bayesian Word Segmentation: When cognitively plausible learners outperform the ideal

نویسندگان

  • Lawrence Phillips
  • Lisa Pearl
چکیده

Purely statistical models have accounted for infants' early ability to segment words out of fluent speech, with Bayesian models performing best (Goldwater et al. 2009). Yet these models often incorporate unlikely assumptions, such as infants having unlimited processing and memory resources and knowing the full inventory of phonemes in their native language. Following Pearl, et al. (2011), we explore the impact of these assumptions on Bayesian learners by utilizing syllables as the basic unit of representation. We find a significant " Less is More " effect (Pearl et al 2011; Newport 1990) where memory and processing constraints appear to help, rather than hinder, performance. Further, this effect is more robust than earlier results and we suggest this is due a relaxing of the assumption of phonemic knowledge, demonstrating the importance of basic assumptions such as unit of representation. We argue that more cognitively plausible assumptions improve our understanding of language acquisition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Utility of Cognitive Plausibility in Language Acquisition Modeling: Evidence From Word Segmentation

The informativity of a computational model of language acquisition is directly related to how closely it approximates the actual acquisition task, sometimes referred to as the model's cognitive plausibility. We suggest that though every computational model necessarily idealizes the modeled task, an informative language acquisition model can aim to be cognitively plausible in multiple ways. We d...

متن کامل

Recession Segmentation: Simpler Online Word Segmentation Using Limited Resources

In this paper we present a cognitively plausible approach to word segmentation that segments in an online fashion using only local information and a lexicon of previously segmented words. Unlike popular statistical optimization techniques, the learner uses structural information of the input syllables rather than distributional cues to segment words. We develop a memory model for the learner th...

متن کامل

Iranian EFL Learners' Processing of English Derived Words

An interesting area of psycholinguistic inquiry is to discover the way morphological structures are stored in the human mind and how they are retrieved during comprehension or production of language. The current study probed into what goes on in the mind of EFL learners when processing derivational morphology and how English and Persian derivational suffixes are processed. 60 Iranian EFL learne...

متن کامل

Taking the child’s view: Syllable-based Bayesian inference as a (more) plausible statistical word segmentation strategy

Because knowledge of words plays a crucial role in acquisition and children seem to accomplish word segmentation very early (~7.5 months (Jusczyk et al., 1999; Echols et al., 1997; Jusczyk et al., 1993a)), many strategies have been proposed for how children learn to identify words in their native language. Because of experimental evidence that infants are sensitive to statistical information in...

متن کامل

How Ideal Are We? Incorporating Human Limitations into Bayesian Models of Word Segmentation

1. Introduction Word segmentation is one of the first problems infants must solve during language acquisition, where words must be identified in fluent speech. A number of weak cues to word boundaries are present in fluent speech, and there is evidence that infants are able to use many of these, including phonotactics However, with the exception of the last cue, all these cues are language-depe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012